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Foreword 



rd , 



This Technical Specification (TS) has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 
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1 Scope 



The present document contains an electronic copy of the ANSI-C code for the Adaptive Multi-Rate codec. The ANSI-C 
code is necessary for a bit exact implementation of the Adaptive Multi Rate speech transcoder (TS 26.090 [2]), Voice 
Activity Detection (TS 26.094 [6]), comfort noise (TS 26.092 [4]), source controlled rate operation (TS 26.093 [5]) and 
example solutions for substituting and muting of lost frames (TS 26.091 [3]). 



References 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. 

[1] TS 26.074: "AMR Speech Codec; Test sequences". 

[2] TS 26.090: "AMR Speech Codec; Speech transcoding". 

[3] TS 26.091: "AMR Speech Codec; Substitution and muting of lost frames". 

[4] TS 26.092: "AMR Speech Codec; Comfort noise aspects". 

[5] TS 26.093: "AMR Speech Codec; Source controlled rate operation". 

[6] TS 26.094: "AMR Speech Codec; Voice Activity Detection". 



3 Definitions and abbreviations 

3.1 Definitions 

Definition of terms used in the present document, can be found in TS 06.090 [2], TS 06.091 [3], TS 06.092 [4], 
TS 06.093 [5] and TS 06.094 [6]. 

3.2 Abbreviations 

For the purpose of the present document, the following abbreviations apply: 

ANSI American National Standards Institute 

ETS European Telecommunication Standard 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 
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C code structure 



This clause gives an overview of the structure of the bit-exact C code and provides an overview of the contents and 
organization of the C code attached to this document. 

The C code has been verified on the following systems: 

Sun Microsystems workstations and GNU gcc compiler; 

DEC Alpha workstations and GNU gcc compiler; 

IBM PC/AT compatible computers with Linux operating system and GNU gcc compiler. 
ANSI-C 9899 was selected as the programming language because portability was desirable. 

4.1 Contents of the C source code 

The C code distrubution has all files in the root level. 

The distributed files with suffix "c" contain the source code and the files with suffix "h" are the header files. The ROM 
data is contained mostly in files with suffix "tab". 

The C code distribution also contains one speech coder installation verification data file, "spch_dos.inp". The reference 
encoder output file is named "spch_dos.cod", the reference decoder input file is named "spch_dos.dec" and the 
reference decoder output file is named "spch_dos.out". These four files are formatted such that they are correct for an 
IBM PC/AT compatible computer. The same files with reversed byte order of the 16 bit words are named 
"spch_unx.inp", "spch_unx.cod", "spch_unx."dec" and "spch_unx.out", respectively. 

Final verification is to be performed using the GSM Adaptive Multi-Rate test sequences described in GSM 06.74 [2]. 

Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is 
installed, this directory will have a compiled version of encoder and decoder (the bit-exact C executables of the speech 
codec) and all the object files. 

4.2 Program execution 

The GSM Adaptive Multi-Rate codec is implemented in two programs: 

{encoder) speech encoder; 

{decoder) speech decoder. 
The programs should be called like: 

encoder [encoder options] <speech input file> <parameter file>; 

decoder [decoder options] <parameter file> <speech output file>. 

The speech files contain 16-bit linear encoded PCM speech samples and the parameter files contain encoded speech 
data and some additional flags. 

The encoder and decoder options will be explained by running the applications with option -h. See the file readme.txt 
for more information on how to run the encoder and decoder programs. 
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4.3 Coding style 



The C code is written according to the following structuring conventions. Each function func() that needs static 
variables is considered a module. A module consists of: 

a 'state structure' (struct) combining the static variables of the module; 

three auxiliary functions func_init(), func_reset(), and func_exit(); 

the processing function func() itself. 

The initialization function func_init() allocates (from the heap) a new state structure, calls the func_reset() function, 
stores the pointer to the newly allocated structure in its first function parameter, and returns with a value of if 
completed successful or a value of 1 otherwise. 

The reset function func_reset() takes a pointer to the state structure and resets all members of the structure to a 
predefined value ('homing'). 

The exit function func_exit() performs any necessary cleanup and frees the state structure memory. 

The processing function func() also takes a pointer to the state structure as well as all other necessary parameters and 
performs its task using (and possibly modifying) the values in the state structure. 

If a module calls other modules, the higher level state structure contains a pointer to the lower level state structures, and 
the init, reset, and exit functions recursively call the corresponding lower level functions. 

By this convention, the code becomes "instantiable" (more than one copy of a module can be used in the same program) 
and the static data hierarchy is clearly visible in the code. 



4.4 Code hierarchy 



Figures 1 to 4 are call graphs that show the functions used in the speech codec, including the functions of VAD, DTX, 
and comfort noise generation. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighboring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
All standard C functions: printf(), fwrite(), etc. have been omitted. Also, no basic operations (add(), L_add(), mac(), 
etc.) or double precision extended operations (e.g. L_Extract()) appear in the graphs. The initialization of the static 
RAM (i.e. calling the _init functions) is also omitted. 

The basic operations are not counted as extending the depth, therefore the deepest level in this software is level 7. 

The encoder call graph is broken down into three separate call graphs. Table 1 to 3. 
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Table 1 : Speech encoder call structure 



Speech_Encocie_Frame 


Pre Process 










cod__amr 


Copy 




Vadi'' 


filter_bank 


first_filter_stage 




filters 




filters 




level calculation 




vad_decision 


complex estimate adapt 




complex vad 




noise estimate update 


update cntrl | 


hangover addition 




Vad2'' 


block_norm 

rjft 

fnlOLoglO 

Pow2 


c fft 
Log2 










Log2 norm 






tx dtx handler 




Ipc 


Autocorr 




Lag window 




Levinson 




Isp 


Az Isp 


Chebps 




Q_plsf_5 


Lsp Isf 




Lsf wt 




Vq subvec 




Vq subvec s 




Reorder Isf 




Lsf Isp 




Int Ipc lands 2 


Lsp az 


Get lsp_pol 


Int Ipc lands 


Lsp az 


Get Isp pol 


Q_plsf_3 


Lsp Isf 




Lsf wt 




Copy 




Vq subvecS 




Vq subvec4 




Reorder Isf 




Lsf Isp 




Int Ipc 1to3 2 


Lsp az 


Get lsp_pol 


Int Ipc 1to3 


Lsp az 


Get lsp_pol 


Copy 






dtx_buffer 


Copy 




Log2 


Log2 norm 




dtx_enc 


Lsp Isf 






Reorder Isf 




Lsf Isp 




Set zero 






lsp_reset 


Copy 




Q pisf reset 




ci Itp reset 


Pitch fr reset 




check Isp 






pre_big 


Weight Ai 




Residu 




Syn tilt 




oljtp 


Pitch_ol 


vad tone detection update^ 




Lag_max 


vad tone detection^ 


Inv sqrt 


comp corr^ 




hp max^ 




vad complex detection update^ 




Pitch_ol_wgh 


comp corr^ 




Lag_max2 


vad tone detection update^ 


vad tone detection^ 


gmed n 




hp max^ 




vad complex detection update^ 




vad_pitch_detection 


LTP flag update^ 






subframePreProc 


Weight Ai 




Syn tilt 




Residu 




Copy 




cljtp 


Pitchjr 

(continued) 


getRange 




Norm_Corr 


Convolve 


Inv sqrt 


search Frac 


Interpol 3or6 


Enc lags 




Enc Iag6 











1 Option to call one or the other VAD option 

2 Specific to VAD option 1 

3 Specific to VAD option 2 



ETSI 



3G TS 26.073 version 3.1.0 Release 1999 



ETSI TS 126 073 V3.1.0 (2000-03) 



Table 1 (concluded): Speech encoder call structure 









Pred It 3or6 


Convolve 


G pitch 


check qp clipping 


q gain pitch 


cbsearch 


see Table 2 


gainQuant 


see Table 3 


update gp clipping 


Copy 


subframePostProc 


Syn tilt 


Pred It 3or6 




Convolve 




Prm2bits 


Int2bin 





Table 2: cbsearch call structure 



Cbsearch 


cod e_2i40_9 bits 


cor h X 




set sign 




cor h 


Inv sqrt 


search 2140 




build code 




code_2i40_1 1 bits 


cor h X 




set sign 




cor h 


Inv sqrt 


search 2140 




build code 




code_3i40_14bits 


cor h X 




set sign 




cor h 


Inv sqrt 


search 3140 




build code 




code_4i40_1 7bits 


cor h X 




set sign 




cor h 


Inv sqrt 


search 4140 




build code 




code_8i40_31bits 


cor h X 




set sign12k2 


Inv sqrt 


cor h 


Inv sqrt 


search 10and8i40 




build code 




compress code 


compresslO 


code_1 0i40_35bits 


cor h X 




set sign12k2 


Inv sqrt 


cor h 


Inv sqrt 


search 10and8i40 




build code 




q_P 





Table 3: gainQuant call structure 



gainQuant 


go pred copy 


Copy 






gc_pred 


Log2 


Log2 norm 




Log2 norm 






calc tilt energies 






calc target energy 




MR475 update unq pred 


gc pred update 




MR475_gain_quant 


MR475_quant_store_results 


Log2 


Log2 norm | 


gc pred update 




gc_pred 


Log2 


Log2 norm | 


Log2 norm 




G code 








q gain code 


Pow2 




MR795_gain_quant 


q gain pitch 




MR795 gain code quant3 




calc unfilt energies 


Log2 


Log2 norm | 


gain adapt 


gmed n 




MR795 gain code quant mod 


sqrt 1 exp 




Qua gain 


Pow2 






gc_pred_update 
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Table 4: Speech decoder call structure 



Speech_Decode_Frame 



Bits2prm 



Decode r_amr 



Post_Process 



Bin2int 



rx_dtx_handler 



Decode r_amr_reset 



LsLIsp 



l5p_avg 



Copy 



D_pl5f_3 



lnt_lpc_1to3 



D_plsf_5 



Int_lpc_1and3 



Dec_lag3 



Pred_lt_3or6 



Dec_lag6 



decode_2i40_9bits 



decode_2i40_11bits 



decode_3i40_14bits 



decode_4i40_1 7bits 



decode_8i40_31bits 



ec_gain_pitch 



d_gain_pitch 



ec_gain_pitch_update 



decode_10i40_35bits 



Dec_gain 



ec_gain_code 



ec_gain_code_update 



d_gain_code 



Intjsf 



Cb_gain_average 



ph_disp_release 



ph_disp_lock 



ph_disp 



sqrt_l_exp 



Ex_ctrl 



agc2 



Synjilt 



Bgn_scd 



dtx_dec_activity_update 



lsp_avg 



Copy 



Weight_Ai 



Residu 



Set_zero 



Synjilt 



Preemphasis 



age 



lsp_avg_reset 



D_plsf_re5et 



ec_gain_pitch_reset 



ec_gain_code_reset 



3C_pred_reset 



Bgn_scd_reset 



ph_di5p_reset 



dtx_dec_reset 



Copy 



Lsfjsp 



lnit_D_plsf_3 



D_plsf_3 



pseudonoise 



Lspjsf 



Reorderjsf 



Lsp_Az 



_Refl 



Log2 



Build_CN_code 



Synjilt 



Set_zero 



Copy 



Copy 



Reorderjsf 



Copy 



Lsfjsp 



GetJsp_pol 



Log2_norm 



pseudonoise 



Reorderjsf 



Copy 



Lsfjsp 



Lsp_Az 



Reorderjsf 



Copy 



Lsfjsp 



Lsp_Az 



GetJsp_pol 



GeUsp_pol 



decompress_code 



gmed_n 



decompresslO 



Log2 



gc_pred 



Pow2 



gc_pred_update 



gmed_n 



gc_pred_average_limeted 
gc_pred_update 



Log2_norm 



Log2 



Log2_norm 



Log2_norm 



gc_pred 



Pow2 



gc_pred_update 



Log2 



Log2_norm 



Log2_norm 



gmed_n 



lnv_sqrt 



gmed_n 



Copy 



Log2 



Log2_norm 



energy_old 



energynew 



lnv_sqrt 



energy_old 
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4.5 Variables, constants and tables 

The data types of variables and tables used in the fixed point implementation are signed integers in 2's complement 
representation, defined by: 

- Wordl6 16 bit variable; 

- Word32 32 bit variable. 

Fiu-thermore some enum types are used, all possible to represent with one byte, and a boolean Flag. 

4.5.1 Description of constants used in the C-code 

This subclause contains a listing of all global constants defined in cnst.h. 

Table 5: Global constants 



Constant 


Value 


Description 


L TOTAL 


320 


total size of speech buffer. 




L WINDOW 


240 


window size in LP analysis 




L FRAME 


160 


frame size 




L FRAME BY2 


80 


frame size divided by 2 




L SUBFR 


40 


subframe size 




L CODE 


40 


codevector lengtli 




NB TRACK 


5 


number of tracks 




STEP 


5 


codebook step size 




NB TRACK MR102 


4 


number of tracks mode mr1 02 




STEP MR102 


4 


codebook step size mode mr1 02 




M 


10 


order of LP filter 




MP1 


(M+1) 


order of LP filter + 1 




LSF GAP 


205 


minimum distance between LSF after quantization 


50 Hz = 205 


LSP PRED FAC MR122 


21299 


MR122 LSP prediction factor (0.65 015) 




AZ SIZE 


44 


size of array of LP filters in 4 subframes (4*M+4) 




PIT MIN MR122 


18 


minimum pitch lag (MR122 mode) 




PIT MIN 


20 


minimum pitch lag (all other modes) 




PIT MAX 


143 


maximum pitch lag 




L INTERPOL 


(10+1) 


length of filter for interpolation 




L INTER SRCH 


4 


length of filter for CL LTP search interpolation 




MU 


26214 


factor for tilt compensation filter 0,8 




AGC FAC 


29491 


factor for automatic gain control 0,9 




L NEXT 


40 


overhead in LP analysis 




SHARPMAX 


13017 


maximum value of pitch sharpening 




SHARPMIN 





minimum value of pitch sharpening 




MAX PRM SIZE 


57 


max. num. of params 




MAX SERIAL SIZE 


244 


max. num. of serial bits 




GP CLIP 


15565 


pitch gain clipping = 0.95 




N FRAME 


7 


old pitch gains in average calculation 




EHF MASK 


8 


16 bit representation of all samples in the encoder 
justification) 


homing frame (left 



4.5.2 Description of fixed tables used in the C-code 

This section contains a listing of all fixed tables sorted by source file name and table name. All table data is declared as 
Wordl6. 
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Table 6: Fixed tables 



File 


Table name 


Length 


Description 


c2_9pf.c 


trackTable 


4*5 


track table for algebraic code book search (MR475, MR515) 


cod_amr.c 


gammal 


10 


spectral expansion factors 


cod_amr.c 


gamma1_12k2 


10 


spectral expansion factors 


cod_amr.c 


gamma2 


10 


spectral expansion factors 


dtxdec.c 


lsf_hist_mean_scale 


10 


initialization values for DTX Isf parameters 


dtxdec.c 


dtx_log_en_adjust 


9 


level adjustments for ech mode 


ec_gains.c 


cdown 


7 


attenuation factors for codebook gain 


ec_gains.c 


pdown 


7 


attenuation factors for adaptive codebook gain 


gc_pred.c 


pred 


4 


algebraic code book gain IVIA predictor coefficients 


gc_pred.c 


pred_MR122 


4 


algebraic code book gain IVIA predictor coefficients (MR122) 


pitch_fr.c 


mode_dep_parm 


72 


parameters defining the adaptive codebook search per mode 


post_pro.c 


a 


3 


HP filter coefficients (denominator) in Post_Process 


post_pro.c 


b 


3 


HP filter coefficients (numerator) in Post_Process 


pre_proc.c 


a 


3 


HP filter coefficients (denominator) in Pre_Process 


pre_proc.c 


b 


3 


HP filter coefficients (numerator) in Pre_Process 


predjt.c 


inter_6 


61 


interpolation filter coefficients 


pstfilt.c 


gamma3_MR122 


10 


spectral expansion factors 


pstfilt.c 


gammas 


10 


spectral expansion factors 


pstfilt.c 


gamma4_MR122 


10 


spectral expansion factors 


pstfilt.c 


gamma4 


10 


spectral expansion factors 


bitno.tab 


prmno 


9 


number of bits for each mode 


bitno.tab 


prmnofsf 


8 


number of parameters for LPC and first subframe for each mode (used for 
decoder homing procedure) 


bitno.tab 


bitno 


9 


pointers to the bitno_MR... tables 


bitno.tab 


bitno MR475 


17 


number of bits per parameter to transmit (MR475) 


bitno.tab 


bitno MR515 


19 


number of bits per parameter to transmit (MR51 5) 


bitno.tab 


bitno MR59 


19 


number of bits per parameter to transmit (MR59) 


bitno.tab 


bitno MR67 


19 


number of bits per parameter to transmit (IVIR67) 


bitno.tab 


bitno_MR74 


19 


number of bits per parameter to transmit (MR74) 


bitno.tab 


bitno MR795 


23 


number of bits per parameter to transmit (MR795) 


bitno.tab 


bitno MR102 


39 


number of bits per parameter to transmit (MR1 02) 


bitno.tab 


bitno MR122 


57 


number of bits per parameter to transmit (IVIR122) 


bitno.tab 


bitno_MRDTX 


5 


number of bits per parameter to transmit (IVIRDTX) 


c2 Hpf.tab 


startPosI 


2 


track start search position for first pulse 


c2_1 1 pf .tab 


startPos2 


4 


track start search position for second pulse 


c2_9pf.tab 


startPos 


16 


track start search position 


corrwght.tab 


corrweight 


251 


weighting of the correlation function in open loop LIP search (MR102) 


d_homing.tab 


dhf 


8 


pointers to the dhf_MR... tables 


d_homing.tab 


dhf MR475 


17 


parameter values for the decoder homing frame (MR475) 


d_homing.tab 


dhf MR515 


19 


parameter values for the decoder homing frame (MR515) 


d_homing.tab 


dhf MR59 


19 


parameter values for the decoder homing frame (MR59) 


d_homing.tab 


dhf MR67 


19 


parameter values for the decoder homing frame (MR67) 


d_homing.tab 


dhf MR74 


19 


parameter values for the decoder homing frame (MR74) 


d_homing.tab 


dhf MR795 


23 


parameter values for the decoder homing frame (MR795) 


d_homing.tab 


dhf MR102 


39 


parameter values for the decoder homing frame (MR1 02) 


d_homing.tab 


dhf_MR122 


57 


parameter values for the decoder homing frame (MR122) 


gains.tab 


qua_gain_pitch 


16 


adaptive codebook gain quantization table (MR122, MR795) 


gains.tab 


qua_gain_code 


96 


fixed codebook gain quantization table (l\/IR122, MR795) 


gray.tab 


gray 


8 


gray coding table 


gray.tab 


dgray 


8 


gray decoding table 


grid.tab 


grid 


61 


grid points at wich Chebyshev polynomials are evaluated 


inter_36.tab 


inter 6 


25 


interpolation filter coefficients 


inv_sqrt.tab 


table 


49 


table used in inverse square root computation 


lag_wind.tab 


lag_h 


10 


high part of the lag window table 


lag wind.tab 


lag 1 


10 


low part of the lag window table 



(continued) 
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Table 6 (concluded): Fixed tables 



File 


Table name 


Length 


Description 


log2.tab 


table 


33 


table used inbase 2 logharithm computation 


Isp.tab 


Isp init data 


10 


initialization table for Isp history in DTX 


Ispjsf.tab 


table 


65 


table to compute cos(x) in LsfJspO 


Ispjsf.tab 


slope 


64 


table to compute acos{x) in Lspjsf() 


ph_disp.tab 


ph_imp_low_MR795 


40 


phase dispersion impulse response (MR795) 


ph_disp.tab 


ph imp mid IVIR79 

5 

phjmpjow 


40 


phase dispersion impulse response (MR795) 


ph_disp.tab 


40 


phase dispersion impulse response (MR475 - MR67) 


ph_disp.tab 


ph imp mid 


40 


phase dispersion impulse response (IVIR475 - MR67) 


pow2.tab 


table 


33 


table used in 2 to the power computation 


q_plsf_3.tab 


past_rq_init 


80 


initialization table for the MA predictor in DTX 


q_plsf_3.tab 


meanjsf 


10 


LSF means (not in MR122) 


q_plsf_3.tab 


pred_fac 


10 


LSF prediction factors (not in MR122) 


q_plsf_3.tab 


dicoljsf 


3*256 


1"" LSF quantizer (not in l\/IR122 and MR795) 


q_plsf_3.tab 


dico2_lsf 


3*512 


2"'^ LSF quantizer (not in MR122) 


q_plsf_3.tab 


dico3_lsf 


4*512 


3''' LSF quantizer (not in MR1 22, MR51 5 and MR475) 


q_plsf_3.tab 


mr515 3 Isf 


4*128 


3''' LSF quantizer (MR515 and MR475) 


q_plsf_3.tab 


mr795_1_lsf 


3*512 


1" LSF quantizer (MR795) 


q_plsf_5.tab 


meanjsf 


10 


LSF means (MR1 22) 


q_plsf_5.tab 


dicoljsf 


4*128 


1^' LSF quantizer (IVIR1 22) 


q_plsf_5.tab 


dico2Jsf 


4*256 


2""* LSF quantizer (MR1 22) 


q_plsf_5.tab 


dico3Jsf 


4*256 


3'''' LSF quantizer (MR1 22) 


q_plsf_5.tab 


dico4Jsf 


4*256 


4* LSF quantizer (MR1 22) 


q_plsf_5.tab 


dicoSJsf 


4*64 


5* LSF quantizer (MR1 22) 


qgain475.tab 


table_gain_MR475 


4*256 


gain quantization table (IVIR475) 


qua_gain.tab 


table_gain_highrate 

s 

table gain lowrates 


128*4 


gain quantization table (IVIR67, MR74 and MR102) 


qua gain.tab 


64*4 


gain quantization table (MR515 and MR59) 


R fft.c 


phsjbl 


128 


sine/cosine phase table 


RJft.c 


ii table 


8 


indexing table 


sqrt 1 


table 


49 


table to compute sqrt(x) 


Vadl.c 


ch tbi 


2*16 


channel energy combination table 


Vadl.c 


chjbl_sh 


16 


channel energy scaling table 


Vad1 .c 


vmjbl 


90 


voice metric table 


Vadl.c 


hangoverjable 


20 


used to determine hangover as a function of SNR 


Vadl.c 


burstcountjable 


20 


used to determine burst count threshold as a function of SNR 


Vadl.c 


vmjhreshjable 


20 


used to determine the voice metric threshold as a function of SNR 


Vadl.c 


energy state tables 


2*6 


constants as a function of scaling state 


window.tab 


window_200_40 


240 


LP analysis window (not in MR122) 


window. tab 


window_1 60_80 


240 


1" LP analysis window (MR122) 


window.tab 


window 232 8 


240 


2"'^ LP analysis window (MR1 22) 



4.5.3 Static variables used in tine C-code 

In this section two tables that specify the static variables for the speech encoder and decoder respectively are shown. All 
static variables are declared within a C struct. 
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Table 7: Speech encoder static variables 



Struct name 


Variable 


Type[Length] 


Description 


Speech_Encode_ 


cod_amr_state 


cod_amrState 


see below in this table 


FrameState 










pre state 


Pre ProcessState 


see below in this table 




dtx 


Flag 


Is set if DTX functionality is used 




complexityCounter 


int 


Used for wMOPS counting 


Pre_ProcessState 


y2 hi 


Word 16 


filter state, upper word 




y2 lo 


Word 16 


filter state, lower word 




y1 hi 


Word 16 


filter state, upper word 




y1 lo 


Word 16 


filter state, lower word 




xO 


Word 16 


filter state 




x1 


Word 16 


filter state 


cod_amrState 


old_speech 


Word16[320] 


speech buffer 




speech 


Word 16* 


pointer to current frame in old_speech 




p_window 


Word 16* 


pointer to LPC analysis window in old_speech 




p_window_12l<2 


Word 16* 


pointer to LPC analysis window with no lookahead 
inold_speech (MR122) 




new_speech 


Word 16* 


pointer to the last 160 speech samples in 
old_speech 




old_wsp 


Word16[303] 


buffer holding spectral weighted speech 




wsp 


Word 16* 


pointer to the current frame in old_wsp 




oldjags 


Word16[5] 


open loop LTP states 




ol_gain_flg 


Word16[2] 


enables open loop pitch lag weighting {MR102) 




old_exc 


Word16[314] 


excitation vector 




exc 


Word 16* 


current excitation 




ai_zero 


Word16[51] 


history of weighted synth. filter followed by zero 
vector 




zero 


Word 16* 


zero vector 




h1 


Word 16* 


impulse response of weighted synthesis filter 




hvec 


Word16[80] 


zero vector followed by impulse response 




IpcSt 


IpcState 


see below in this table 




IspSt 


IspState 


see below in this table 




cILtpSt 


cILtpState 


see below in this table 




gainQuantSt 


gainQuantState 


see below in this table 




pitchOLWghtSt 


pitchOLWghtState 


see below in this table 




tonStabSt 


tonStabState 


see below in this table 




vadSt 


vadStatel 


see below in this table 




vadSt 


vadState2 


see below in this table 




dtx 


Flag 


is set if DTX functionality is used 




dtx_encSt 


dtx encState 


see below in this table 




mem_syn 


Word16[10] 


synthesis filter memory 




mem_wO 


Word16[10] 


weighting filter memory (applied to error signal) 




mem_w 


Word16[10] 


weighting filter memory (applied to input signal) 




mem_err 


Word16[50] 


filter memory for production of error vector 




error 


Word 16* 


error signal (input minus synthesized speech) 




sharp 


Word 16 


pitch sharpening gain 


vadStatel 


bckrest 


Word16[9] 


background noise estimate 




avejevel 


Word16[9] 


averaged input components for stationary 
estimation 




oldjevel 


Word16[9] 


input levels of the previous frame 




subjevel 


Word16[9] 


input levels calculated at the end of a frame 
(lookahead) 




a_data5 


Word16[6] 


memory for the filter bank 




a_data3 


Word16[5] 


memory for the filter bank 




burst_count 


Word 16 


counts length of a speech burst 




hang_count 


Word 16 


hangover counter 




stat_count 


Word 16 


stationary counter 




vadreg 


Word 16 


1 5 flags for intermediate VAD decisions 




pitch 


Word 16 


1 5 flags for pitch detection 




tone 


Word 16 


1 5 flags for tone detection 




complex_high 


Word 16 


flags for complex detection 




complexjow 


Word 16 


flags for complex detection 




oldlag_count 


Word 16 


variables for pitch detection 




oldlag 


Word 16 


variables for pitch detection 




complex_hang_count 


Word 16 


complex hangover counter, used by VAD 
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Struct name 


Variable 


Type[Lengtli] 


Description 


vadState2 


complex_hang_timer 

best_corr_hp 

speech_vad_decision 

complex_warning 

sp_burst_count 

corr_hp_fast 

pre_emp_mem 

update_cnt 

hyster_cnt 

last_update_cnt 

ch_enrg_long_db 

Lframe_cnt 

Lch_enrg 

Lch_noise 

last_normb_shift 

tsnr 

hangover 
burstcount 
fupdatejiag 

negSNRvar 

negSNRbias 

shift state 

L_RO 

L Rmax 

LTPJIag 


Word 16 

Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word16[16] 
Word32 
Word32[1 6] 
Word32[1 6] 
Word 16 

Word 16 
Word 16 
Word 16 
Word 16 

Word 16 
Word 16 
Word 16 
Word32 
Word32 
Flag 


hangover initiator, used by GAD 

filtered value 

final decision 

complex background warning 

counts length of a speech burst inci HO addition 

filtered value 

input pre-emphasis memory 

noise update counter 

hysteresis counter 

noise update counter value for last frame 

long term channel energy in dB 

10 ms frame counter 

channel energy estimate 

channel noise estimate 

block shift factor for last frame, used for 

pre_emp_mem 

total estimated peak SNR in dB 

VAD hangover 

number of consecutive voice active frames 

A flag to control a forced update of the noise 

estimate 

SNR variability 

sensitivity bias 

indicates scaling state of channel energy estimate 

LTP energy 

LTP max correlation 

set when open loop pitch prediction gain > 

threshold 


dtx_encState 


lsp_hist 
log_en_hist 
hist_ptr 
log_enJndex 

initjsf_vqjndex 
Ispjndex 
dtxHangoverCount 
decAnaElapsedCount 


Word16[80] 
Word16[8] 
Word 16 
Word 16 

Word 16 
Word16[3] 
Word 16 
Word 16 


LSP history (8 frames) 
logarithmic frame energy history (8 frames) 
pointer to the cyclic history vectors 
Index for logarithmic energy 

initial index for Isf predictor 
Isp indecies to the three code books 
is decreased in DTX hangover period 
counter for elapsed speech frames in DTX 


IpcState 


LevinsonSt 


LevinsonState 


see below 


LevinsonState 


old A 


Word16[11] 


last frames direct form coefficients 


IspState 


lsp_old 
Isp old q 
qSt 


Word16[10] 
Word16[10] 
Q pIsfState 


old LSP vector 

old quantized LSP vector 

see below in this table 


Q_plsfState 


past_rq 


Word16[10] 


past quantized LSF prediction error 


cILtpState 


pitchSt 


Pitch frState 


see below in this table 


tonStabState 


count 

gp 


Word 16 
Word16[7] 


count consecutive (potential) resonance frames 
pitch gain history 


Pitch frState 


TO prev subframe 


Word 16 


integer, pitch lag of previous subframe 


gainQuantState 


sfO_exp_gcodeO 

sfOJrac_gcodeO 

sfO_expJarget_en 

sfOJracJarget_en 

sfO_exp_coeff 

sfOJrac_coeff 

gainjdx_ptr 

gc_predSt 

gc predUncSt 

adaptSt 


Word 16 
Word 16 
Word 16 
Word 16 
Word16[5] 
Word16[5] 
Word 16* 
gc_predState 
gc_predState 
GainAdaptState 


subframe 0/2 codebook gain exponent 
subframe 0/2 codebook gain fraction 
subframe 0/2 target energy exponent 
subframe 0/2 target energy fraction 
subframe 0/2 energy coefficient exponents 
subframe 0/2 energy coefficient fractions 
pointer to gain index value in parameter frame 
see below in this table 
see below in this table 
see below in this table 


gc_predState 


past_qua_en 
past_qua_en_l\/lR1 22 


Word16[4] 
Word16[4] 


MA predictor memory (20*log10(pred. error)) 
MA predictor memory, 12.2 style (log2(pred. 
error)) 


GainAdaptState 


onset 

prev_alpha 
prev_gc 
ltpg_mem 


Word 16 
Word 16 
Word 16 
Word16[5] 


onset counter 
previous adaptor output 
previous codebook gain 
pitch gain history 
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Struct name 


Variable 


Type[Length] 


Description 


pitchOLWghtState 


old_TO_med 
ada w 


Word 16 
Word 16 


weighted open loop pitch lag 

weigthing level depeding on open loop pitch gain 




wght fig 


Word 16 


switches lag weighting on and off 



Table 8: Speech decoder static variables 



Struct name 


Variable 


Type[Lengtli] 


Description 


Speech_Decode_Fram 


decoder_amrState 


Decoder_amrState 


see below in this table 


estate 










post_state 


Post_FilterState 


see below in this table 




postHP_state 


Post_ProcessState 


see below in this table 




ComplexityCounter 


int 


Used for wMOPS counting 


Decoder_amrState 


old_exc 


Word16[194] 


excitation vector 




exc 


Word 16* 


current excitation 




lsp_old 


Word16[10] 


LSP vector of previous frame 




mem_syn 


Word16[10] 


synthesis filter memory 




sharp 


Word 16 


pitch sharpening gain 




old_TO 


Word 16 


pitch sharpening lag 




prev_bf 


Word 16 


previous value of "bad frame" flag 




prev_pdf 


Word 16 


previous value of "pot. dangerous frame" flag 




state 


Word 16 


ECU state (0..6) 




excEnergyHist 


Word16[9] 


excitation energy history 




TOJagBuff 


Word 16 


received pitch lag for ECU 




InBackgroundNoise 


Word 16 


background noise flag 




voicedHangover 


Word 16 


hangover flag 




ItpGainHlstory 


Word16[9] 


pitch gain history 




background_state 


Bgn_scdState 


see below in this table 




Cb_gain_averState 


Cb_gain_averageStat 


see below in this table 




Isp avg St 


e 
lsp_avgState 


see below in this table 




IsfState 


D_plsfState 


see below in this table 




ec_gain_p_st 


ec_gain_pitchState 


see below in this table 




ec_gain_c_st 


ec_gain_codeState 


see below in this table 




pred_state 


go predState 


see table 7 




nodataSeed 


Word 16 


seed for CN generator 




ph_disp_st 


ph_dispState 


see below in this table 




dtxDecoderState 


dtx decState 


see below in this table 


dtx_decState 


since_last_sid 


Word 16 


number of frames since last SID frame 




true_sidjDeriod_inv 


Word 16 


inverse of true SID update rate 




log_en 


Word 16 


logarithmic frame energy 




old_log_en 


Word 16 


previous value of log_en 




L_pn_seed_rx 


Word32 


random number generator seed 




Isp 


Word16[10] 


LSP vector 




Isp old 


Word16[10] 


previous LSP vector 




lsf_hist 


Word16[80] 


LSP vector history (8 frames) 




lsf_hist_ptr 


Word 16 


index to beginning of LSP history 




lsf_hist_mean 


Word16[80] 


mean-removed LSP history (8 frames) 




log_pg_mean 


Word 16 


mean-removed logarithmic prediction gain 




log_en_hist 


Word16[8] 


logarithmic frame energy history 




log_en_hist_ptr 


Word 16 


index to beginning of log, frame energy 
history 




log_en_adjust 


Word 16 


mode-dependent frame energy adjustment 




dtxHangoverCount 


Word 16 


counts down in hangover period 




decAnaElapsedCount 


Word 16 


counts elapsed speech frames after DTX 




sid_frame 


Word 16 


flags SID frames 




valid_data 


Word 16 


flags SID frames containing valid data 




dtxHangoverAdded 


Word 16 


flags hangover period at end of speech 




dtxGlobalState 


enum DTXStatelype 


DTX state flags 




data updated 


Word 16 


flags CNI updates 


BgnscdState 


frameEnergyHist 


Word16[60] 


history of synthesis frame energy 




bgHangover 


Word 16 


number of frames since last speech frame 
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Struct name 


Variable 


Type[Lengtli] 


Description 


Cb_gain_averageState 


cbGainHistory 
hangVar 


Word16[7] 
Word 16 


codebook gain history 

counts length of talkspurt in subframes 




hangCount 


Word 16 


number of subframes since last talkspurt 


Isp avgState 


Isp meanSave 


Word16[10] 


averaged LSP vector 


D_plsfState 


past_r_q 
past Isf q 


Word16[10] 
Word16[10] 


past quantized LSF prediction vector 
past dequantized LSF vector 


ec_gain_pitchState 


pbuf 

past_gain_pit 
prev gp 


Word16[5] 
Word 16 
Word 16 


pitch gain history 

previous pitch gain (limited to 1 .0) 

previous good pitch gain 


ec_gain_codeState 
ph_dispState 


gbuf 

past_gain_code 

prevgc 

gainMem 

prevState 

prevCbGain 

lockFull 

onset 


Word16[5] 
Word 16 
Word 16 
Word16[5] 
Word 16 
Word 16 
Word 16 
Word 16 


codebook gain history 

previous codebook gain 

previous good codebook gain 

pitch gain history 

previously used impulse response 

previous codebook gain 

force maximum phase dispersion 

onset counter 


Post_FilterState 


res2 

mem_syn_pst 
synth_buf 
agc_state 
preemph state 


Word16[40] 

Word16[10] 

Word16[170] 

agcState 

preemphasisState 


LP residual 

synthesis filter memory 
synthesis filter work area 
see below in this table 
see below in this table 


agcState 


past gain 


Word 16 


past age gain 


preemphasisState 


mem_pre 


Word 16 


filter state 


Post_ProcessState 


y2 hi 
y2 lo 
y1 hi 
y1 lo 
xO 
x1 


Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word 16 


filter state, upper word 
filter state, lower word 
filter state, upper word 
filter state, lower word 
filter state 
filter state 



Homing procedure 



The principles of the homing procedures are described in [2]. This specification only includes a detailed description of 
the 8 decoder homing frames. For each AMR codec mode, the corresponding decoder homing frame has a fixed set of 
speech parameters shown in table 9a-9h. The bit allocation within these parameters is identical to the corresponding bit 
allocation of the source encoder output parameters given in [2] . 

In the following tables, the following naming convention is used for the individual parameters. Letters in italics indicate 
numbers. 



LPC_n 

LTP-LAG m 

LTP-GAIN m 

FCB-GAIN m 

GAIN_VQ m 

POS m_n 

POS m_n_k 

POS m_n_k_l^ 

SIGN m_n_k 

SIGN m_ n_kJJ 

SIGN_m_n_/t_POS_OT_n 



index of nth LSF submatrix. 

adaptive codebook index for subframe m. 

adaptive codebook gain index in subframe m. 

fixed codebook gain index in subframe m. 

codebook gain VQ index in subframe m (subframe m and m+1 for MR475). 

position index of nth pulse in subframe m. 

position index of nth and A:th pulse in subframe m. 

position index of nth, Ath, fth, andjth pulse in subframe m. 

sign information for nth and A:th pulse in subframe m. 

sign information for nth, feth, fth, andyth pulse in subframe m. 

sign information for nth and ^th pulse and position index for nth pulse in subframe m. 
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Table 9a: Parameter values for the decoder homing frame (MR475) 



Parameter 


Value (LSB=b0) 


LPC1 


OxOOFS 


LPC2 


0x009D 


LPC3 


0x001 C 


LIP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0028 


LIP-LAG 2 


OxOOOF 


POS 2 1 2 


0x0038 


SIGN 2 1 2 


0x0001 


LIP-LAG 3 


OxOOOF 


POS 3 1 2 


0x0031 


SIGN 3 1 2 


0x0002 


GAIN-VQ 3 


0x0008 


LIP-LAG 4 


OxOOOF 


POS 4 1 2 


0x0026 


SIGN 4 1 2 


0x0003 



Table 9b: Parameter values for the decoder homing frame (MR515) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPG2 


0x009D 


LPG3 


0x001 


LIP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LIP-LAG 2 


OxOOOF 


POS 2 1 2 


0x0000 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


0x0005 


LIP-LAG 3 


OxOOOF 


POS 3 1 2 


0x0037 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LIP-LAG 4 


OxOOOF 


POS 4 1 2 


0x0023 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x001 F 
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Table 9c: Parameter values for the decoder homing frame (MR59) 



Parameter 


Value (LSB=b0) 


LPC1 


OxOOFS 


LPC2 


0x00E3 


LPC3 


0x002F 


LIP-LAG 1 


OxOOBD 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LIP-LAG 2 


OxOOOF 


POS 2 1 2 


0x0001 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


OxOOOF 


LIP-LAG 3 


0x0060 


POS 3 1 2 


0x00F9 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LIP-LAG 4 


OxOOOF 


POS 4 1 2 


0x0000 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x0037 



Table 9d: Parameter values for the decoder homing frame (MR67) 



Parameter 


Value (LSB=b0) 


LPC1 


OxOOFS 


LPC2 


0x00E3 


LPG3 


0x002F 


LIP-LAG 1 


OxOOBD 


POS 112 3 


0x0002 


SIGN 112 3 


0x0007 


GAIN-VQ 1 


0x0000 


LIP-LAG 2 


OxOOOF 


POS 2 1 2 3 


0x0098 


SIGN 2 12 3 


0x0007 


GAIN-VQ 2 


0x0061 


LIP-LAG 3 


0x0060 


POS 3 1 2 3 


0x05C5 


SIGN 3 12 3 


0x0007 


GAIN-VQ 3 


0x0000 


LIP-LAG 4 


OxOOOF 


POS 4 1 2 3 


0x0318 


SIGN 4 12 3 


0x0007 


GAIN-VQ 4 


0x0000 
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Table 9e: Parameter values for the decoder homing frame (MR74) 



Parameter 


Value (LSB=b0) 


LPC1 


OxOOFS 


LPC2 


0x00E3 


LPC3 


0x002F 


LIP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


GAIN-VQ 1 


0x0000 


LIP-LAG 2 


0x001 B 


POS 2 12 3 4 


0x0208 


SIGN 2 12 3 4 


OxOOOF 


GAIN-VQ 2 


0x0062 


LIP-LAG 3 


0x0060 


POS 3 12 3 4 


0x1 BA6 


SIGN 3 12 3 4 


OxOOOF 


GAIN-VQ 3 


0x0000 


LIP-LAG 4 


0x001 B 


POS 4 12 3 4 


0x0006 


SIGN 4 12 3 4 


OxOOOF 


GAIN-VQ 4 


0x0000 



Table 9f : Parameter values for the decoder homing frame (MR795) 



Parameter 


Value (LSB=b0) 


LPG1 


0x00C2 


LPC2 


0x00E3 


LPG3 


0x002F 


LIP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


LIP-GAIN 1 


OxOOOA 


FOB-GAIN 1 


0x0000 


LIP-LAG 2 


0x0039 


POS 2 12 3 4 


0x1 COS 


SIGN 2 12 3 4 


0x0007 


LIP-GAIN 2 


OxOOOA 


FCB-GAIN 2 


OxOOOB 


LIP-LAG 3 


0x0063 


POS 3 12 3 4 


0x1 1A6 


SIGN 3 12 3 4 


OxOOOF 


LIP-GAIN 3 


0x0001 


FCB-GAIN 3 


0x0000 


LIP-LAG 4 


0x0039 


POS 4 12 3 4 


0x09A0 


SIGN 4 12 3 4 


OxOOOF 


LIP-GAIN 4 


0x0002 


FCB-GAIN 4 


0x0001 
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Table 9g: Parameter values for the decoder homing frame (MR102) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LIP-LAG 1 


0x0045 


SIGN 1 1 5 


0x0000 


SIGN 1 2 6 


0x0000 


SIGN 1 3 7 


0x0000 


SIGN 1 4 8 


0x0000 


POS 112 5 


0x0000 


POS 1 3 6 7 


0x0000 


POS 1 4 8 


0x0000 


GAIN-VQ 1 


0x0000 


LIP-LAG 2 


0x001 B 


SIGN 2 1 5 


0x0000 


SIGN 2 2 8 


0x0001 


SIGN 2 3 7 


0x0000 


SIGN 2 4 8 


0x0001 


POS 2 12 5 


0x0326 


POS 2 3 6 7 


OxOOCE 


POS 2 4 8 


0x007E 


GAIN-VO 2 


0x0051 


LIP-LAG 3 


0x0062 


SIGN 3 1 5 


0x0000 


SIGN 3 2 6 


0x0000 


SIGN 3 3 7 


0x0000 


SIGN 3 4 8 


0x0000 


POS 3 12 5 


0x01 5A 


POS 3 3 6 7 


0x0359 


POS 3 4 8 


0x0076 


GAIN-VO 3 


0x0000 


LIP-LAG 4 


0x001 B 


SIGN 4 1 5 


0x0000 


SIGN 4 2 6 


0x0000 


SIGN 4 3 7 


0x0000 


SIGN 4 4 8 


0x0000 


POS 4 12 5 


0x01 7C 


POS 4 3 6 7 


0x0215 


POS 4 4 8 


0x0038 


GAIN-VO 4 


0x0030 
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Table 9h: Parameter values for the decoder homing frame (MR122) 



Parameter 


Value (LSB=bO) 


LPC1 


0x0004 


LPC2 


0x002A 


LPC3 


OxOODB 


LPC4 


0x0096 


LPC5 


0x002A 


LTP-LAG 1 


0x0156 


LIP-GAIN 1 


OxOOOB 


SIGN 1 1 6 POS 1 1 


0x0000 


SIGN 1 2 7 POS 1 2 


0x0000 


SIGN 1 3 8 POS 1 3 


0x0000 


SIGN 1 4 9 POS 1 4 


0x0000 


SIGN 1 5 10 POS 1 5 


0x0000 


P0S1 6 


0x0000 


P0S1 7 


0x0000 


P0S1 8 


0x0000 


P0S1 9 


0x0000 


P0S1 10 


0x0000 


FCB-GAIN 1 


0x0000 


LTP-LAG 2 


0x0036 


LIP-GAIN 2 


OxOOOB 


SIGN 2 1 6 POS 2 1 


0x0000 


SIGN 2 2 7 POS 2 2 


OxOOOF 


SIGN 2 3 8 POS 2 3 


OxOOOE 


SIGN 2 4 9 POS 2 4 


OxOOOC 


SIGN 2 5 10 POS 2 5 


OxOOOD 


POS 2 6 


0x0000 


POS 2 7 


0x0001 


POS 2 8 


0x0005 


POS 2 9 


0x0007 


POS 2 10 


0x0001 


FCB-GAIN 2 


0x0008 


LTP-LAG 3 


0x0024 


LIP-GAIN 3 


0x0000 


SIGN 3 1 6 POS 3 1 


0x0001 


SIGN 3 2 7 POS 3 2 


0x0000 


SIGN 3 3 8 POS 3 3 


0x0005 


SIGN 3 4 9 POS 3 4 


0x0006 


SIGN 3 5 10 POS 3 5 


0x0001 


POS 3 6 


0x0002 


POS 3 7 


0x0004 


POS 3 8 


0x0007 


POS 3 9 


0x0004 


POS 3 10 


0x0002 


FCB-GAIN 3 


0x0003 


LTP-LAG 4 


0x0036 


LIP-GAIN 4 


OxOOOB 


SIGN 4 1 6 POS 4 1 


0x0000 


SIGN 4 2 7 POS 4 2 


0x0002 


SIGN 4 3 8 POS 4 3 


0x0004 


SIGN 4 4 9 POS 4 4 


0x0000 


SIGN 4 5 10 POS 4 5 


0x0003 


POS 4 6 


0x0006 


POS 4 7 


0x0001 


POS 4 8 


0x0007 


POS 4 9 


0x0006 


POS 4 10 


0x0005 


FCB-GAIN 4 


0x0000 
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6 File formats 

This section describes the file formats used by the encoder and decoder programs. The test sequences defined in [1] also 
use the file formats described here. 

6.1 Speech file (encoder input / decoder output) 

Spech files read by the encoder and written by the decoder consist of 16-bit words where each word contains a 13-bit, 
left aligned speech sample. The byte order depends on the host architecture (e.g. MSByte first on SUN workstations, 
LSByte first on PCs etc.). Both the encoder and the decoder program process complete frames (of 160 samples) only. 

This means that the encoder will only process n frames if the length of the input file is n'*160 + k words, while the files 
produced by the decoder will always have a length of n* 160 words. 

6.2 Mode control file (encoder input) 

The encoder program can optionally read in a mode control file which specifies the encoding mode for each frame of 
speech processed. The file is a text file containing one line per speech frame. Each line contains one of the mode names 
from the list {MR475, MR515, MR59, MR67, MR74, MR795, MR102, MR122}. 

6.3 Parameter bitstream file (encoder output / decoder input) 

The files produced by the speech encoder/expected by the speech decoder contain an arbitrary number of frames in the 
following format. 



FRAME_TYPE 


Bl 


B2 




B244 


MODE_INFO 


unusedl 




unused4 



Each box corresponds to one Wordl 6 value in the bitstream file, for a total of 250 words or 500 bytes per frame. The 
fields have the following meaning: 



FRAME_TYPE 


transmit frame 


type, 




TX_SPEECH 


(0x0000) 




TX_SID_FIRST 


(0x0001) 




TX_SID_UPDATE 


(0x0002) 




TX_NO_DATA 


(0x0003) 



which 



IS 



one 



of 



50...B244 



speech encoder parameter bits (i.e. the bitstream itself). Each bx either has the 
value 0x0000 or OxOOOi. Only mode mri22 really uses all 244 bits; for the other 

modes, only the first n bits are used (35 < n < 204). The remaining bits are unused 
(written as 0x0 0) 



MODE_INFO 


encoding 


mode 


information 




MR4 7 5 




(0x0000) 




MRS 15 




(0x0001) 




MRS 9 




(0x0002) 




MR67 




(0x0003) 




MR7 4 




(0x0004) 




MR7 9S 




(OxOOOS) 




MR102 




(0x0006) 




MR12 2 




(0x0007) 



which 



IS 



one 



of 



unusedl. ..4 unused, written as OxOOOO 

As indicated in section 6. 1 above, the byte order depends on the host architecture. 
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Annex A (infonnative): 
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